Accommodating for 3D Head Movement in Visual Lipreading
نویسندگان
چکیده
In automatic lipreading, the speaker’s head movement can affect the mouth shape appearing in the captured images independently of the true mouth shape. Such distortion can lead to incorrect recognition of visual speech, thus this problem must be dealt with for a practical application. We have developed a system that accomodates the 3D head movement of the speaker in recognising the mouth shape of unadorned lips appearing in the image. This is achieved by tracking the 3D head movement from the 2D input and correcting the mouth shape that is detected from the image, in order to robustly recognise the visual phonemes. The system allows the speaker’s head to move and exhibit rotations of up to 30 degrees away from the camera. An experiment is presented, showing head movements in 3D while the speaker enunciates 5 phonemes that represent the various mouth shapes of visual speech. The result shows that the 3D head tracker robustly tracks the 3D head movement, and accomodating head pose to correct mouth shape is achieved effectively.
منابع مشابه
The UWB 3d talking head text-driven system controlled by the SAT method used for the LIPS 2009 challenge
This paper describes the 3D talking head text-driven system controlled by the SAT (Selection of Articulatory Targets) method developed at the University of West Bohemia (UWB) that will be used for participation in the LIPS 2009 challenge. It gives an overview of methods used for visual speech animation, parameterization of a human face and a tongue, and a synthesis method. A 3D animation model ...
متن کاملA 3D Head Tracker for an Automatic Lipreading System
A real world automatic lip reading system must be able to cope with movement of the speaker’s head during operation. The observed mouth shape depends not only on the true shape of the mouth, but also the angle at which the mouth is viewed. As the speaker’s head moves and rotates the viewing angle changes. The resulting distortion can lead to inaccurate mouth measurement and incorrect phoneme re...
متن کاملA 3D Head Tracker for an AutolTIatic Lipreading System
A real world automatic lip reading system must be able to cope with movement of the speaker's head during operation. The observed mouth shape depends not only on the true shape of the mouth, but also the angle at which the mouth is viewed. As the speaker's head moves and rotates the viewing angle changes. The resulting distortion can lead to inaccurate mouth measurement and incorrect phoneme re...
متن کاملA development of Czech talking head
This paper presents a research on the Czech talking head system. It gives an overview of methods used for visual speech animation, parameterization of a human face and a tongue, necessary data sources and a synthesis method. A 3D animation model is used for a pseudo-muscular animation schema to create such animation of visual speech which is usable for a lipreading. An extension of animation sc...
متن کاملUnderstanding the visual speech signal
For machines to lipread, or understand speech from lip movement, they decode lip-motions (known as visemes) into the spoken sounds. We investigate the visual speech channel to further our understanding of visemes. This has applications beyond machine lipreading; speech therapists, animators, and psychologists can benefit from this work. We explain the influence of speaker individuality, and dem...
متن کامل